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Attempts  to  predict  academic  success  of  college 
students  by  traditional  methods  have  failed  to  produce  the 
kinds  of  information  most  needed  by  counselors.  Data  from 
an  existing  study  were  used  in  searching  for  effective 
methods  of  a less  traditional  nature.  Comparisons  were  made 
of  the  predictive  efficiency  of  three  species  of  profile 
analytic  measures  and  one  traditional  variable  specie.  The 
four  species  included  shape  measures,  distance  measures,  a 
type  variable  composed  of  successive  use  of  the  preceding 
two  methods,  and  factor  scores.  The  multiple  regression 
model  was  used,  with  cross-validation  to  compensate  for 
shrinkage.  A part  of  the  cross-validation  procedure  re- 
quired much  effort  in  revising  one  method  of  extending  fac- 
tor matrices . 

It  was  found  that  profile  measures  involving  only  shape 
or  distance  were  inferior  to  the  method  devised  by  Guertin, 
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which  used  a successive  combination  of  shape  analysis  and 
distance  analysis  (type  variables) . Results  were  inconclu- 
sive concerning  relative  effectiveness  of  the  type  variables 
and  the  factor  scores  as  the  factor  scores  were  better  pre- 
dictors in  the  majority  of  instances,  but  only  by  a slim 
margin . 

It  was  recognized  from  existing  studies  that  the 
qualitative  information  available  with  Guertin's  method  is 
sufficiently  valuable  to  be  an  end  in  itself.  Therefore 
the  possibility  of  the  method's  use  to  provide  continuous 
variables  is  a potentially  valuable  by-product  that  could 
be  developed  to  further  enhance  the  technique.  Re-creation 
of  the  variables  with  subsequent  subsamples  can  be  done 
quite  economically,  making  additional  research  quite  feasible. 

Suggestions  were  made  concerning  practical  applications 
of  the  findings  in  relation  to  current  education  problems. 
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CHAPTER  1 


INTRODUCTION 

The  enlargement  of  the  scope  of  public  post  high 
school  education  with  its  accompanying  costs , time  and 
money  from  the  student  as  well  as  money  from  the  taxpayer, 
signals  a need  for  better  tools  for  effective  counseling 
and  academic  placement.  The  status  of  current  techniques 
is  evident  from  the  attitude  of  those  student  personnel 
practitioners  who  point  to  the  futility  of  attempting  to 
predict  who  "can"  and  who  "cannot"  succeed  academically. 

The  answer  they  seem  to  seek  is  who  will  and  who  will  not 
succeed. 

Attempts  to  predict  academic  success  of  college  students 
by  more  traditional  methods  have,  on  the  whole,  failed  to 
produce  sufficient  significant  information  for  use  by 
counselors  (Keenen  and  Holmes,  1970).  That  relationships 
actually  exist  between  a variety  of  measuring  devices,  used 
singly  or  in  groups,  and  the  academic  progress  of  students 
is  a well-documented  fact.  Predictions  can  be  made  with  a 
degree  of  success  for  groups  of  students  (Fishman  and 
Pasanella,  1960;  Lavin,  1965;  Clarke  and  Ammons,  1970),  but 
the  methods  used  are  unsuccessful  for  prognostication  with 
individuals . 
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Efforts  by  behavioral  scientists  to  formulate  lav/s 
by  which  human  behavior  can  be  predicted  would  seem  to  be 
limited  not  only  by  the  adequacy  of  methods  for  testing 
theories  (hypotheses)  but  also  by  methods  of  actually  build- 
ing those  theories.  According  to  Spence  (1960),  one  does 
not  have  a scientific  theory  until  constructs  are  introduced 
with  precision,  for  this  is  the  only  way  in  which  they  can 
be  verified  or  rejected.  Spence  pointed  out  that  theory 
building  is  a necessity  because  knowledge  of  specific  events 
does  not  allow  a basis  for  prediction  when  the  situation 
changes  in  the  slightest  degree.  He  postulated  (Spence, 

1960,  p.  32)  that  the  higher  the  level  of  abstraction  that 
can  be  obtained,  the  higher  the  level  of  understanding  that 
can  be  achieved.  Such  theoretical  constructs  (abstractions) 
would  seem  to  be  synonymous  with  the  definition  of  inter- 
vening variables,  first  proposed  by  Tolman  (1938)  and  refined 
by  others,  including  Hull  (1943)  and  Spence  (1942). 

The  basic  question  to  be  considered  here  is  whether 
more  accurate  prediction  is  achieved  through  computation 
with  a simple  combination  of  individual  variables  (first- 
order  variables)  or  whether  such  predictions  can  be  improved 
through  use  of  empirical  constructs  (intervening  variables) 
such  as  traits  or  types  derived  through  mathematical  com- 
putations. The  latter  would  constitute  higher-order  vari- 
ables, which,  having  been  derived  from  combinations  of 
variables  of  the  first  order,  would  of  course  be  at  a higher 
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level  of  abstraction.  In  order  to  explore  the  question  it 
is  necessary  to  express  constructs  quantitatively,  prefer- 
ably as  continuous  data.  Because  several  alternative  con- 
structs can  be  derived  from  the  same  data  level  variables, 
it  is  important  that  more  than  one  be  investigated  with  the 
same  data  set.  Differential  prediction  for  subgroups, 
whereby  each  student's  profile  of  test  results  would  be 
analyzed  and  his  chances  for  "success"  projected  on  the 
basis  of  the  performance  of  students  with  whom  his  profile 
was  congruent,  seems  to  offer  the  best  possibility  for  such 
exploration  at  this  time.  Accordingly,  the  present  study 
was  designed  to  compare  the  relative  effectiveness  of  test 
items  and  scores  with  several  profile  analysis  measures 
when  they  are  used  as  predictors  in  multiple  regression 
analysis . 

Review  of  the  Literature 

Though  many  educators  claim  to  make  diagnoses  from  pro- 
files, the  usual  data  available  to  them  are  merely  profiles 
of  the  means  of  each  variable  for  sample  groups.  Frequently 
there  is  insufficient  information  concerning  the  variance  of 
the  scores.  Guertin  and  Bailey  (1970)  hold  that  only  modal 
patterns,  hypothetical  profiles  of  test  scores  character- 
istic of  a number  of  individuals  in  a diagnostic  group, 
should  be  used  for  diagnostic  purposes.  The  term  profile 


is  used  here  to  refer  to  a set  of  scores  obtained  by  a 
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particular  individual  on  a given  set  of  measuring  devices. 
The  term  pattern  refers  to  the  profile  (configuration)  of 
mean  scores  of  a group  of  individuals  who  have  been  found 
to  have  congruent  profiles. 

The  literature  concerning  techniques  for  isolating  sub- 
groups according  to  similarity  of  profiles  is  somewhat 
sparse  considering  the  emphasis  on  importance  of  patterns 
by  many  persons  in  the  social  sciences . According  to 
McQuitty  (1961) , theories  of  the  importance  of  patterns  fail 
to  be  either  well-substantiated  or  rejected,  partially  due 
to  the  lack  of  adequate  statistical  methods  for  the  isola- 
tion of  patterns.  Those  methods  have  been  influenced  by 
three  aspects  of  the  problem  (Nunnally,  1962) : 

A.  How  can  the  degree  of  similarity  of  a pair  of 
profiles  be  measured? 

B.  What  is  the  best  technique  to  differentiate  the 
patterns  of  several  groups? 

C.  How  can  profiles  be  clustered  into  homogeneous 
groups? 

Additionally,  Osgood  and  Suci  (1952)  have  pointed  out  the 
importance  of  the  question  of  whether  profile  similarity 
should  be  judged  on  the  basis  of  shape,  level,  scatter,  or 
some  combination  of  the  three. 

Comparisons  of  the  methods  of  profile  analysis  avail- 
able in  the  early  1950s  (Cronbach  and  Gleser,  1953;  Gaier 
and  Lee,  1953)  describe  procedures  that  can  be  divided  into 
correlation  measures  and  distance  measures.  A correlation 
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method  of  profile  grouping  presented  by  Stephenson  (1936 ) 
involved  correlation  of  people  rather  than  of  the  tests  in 
a matrix.  Analysis  of  the  resulting  interperson  correlation 
matrix  was  called  transpose  factor  analysis  by  Burt  (1937) . 
Factor  analysis  of  the  transposed  matrix  was  termed  Q- 
technique  by  Cattell  (1952). 

The  distance  measures  include  methods  devised  by 
Pearson  (1928)  , Mahalanobis  (1936)  , Cattell  (1949)  , Osgood 
and  Suci  (1952),  Cronbach  and  Gleser  (1953),  and  duMas 
(1949) . Cronbach  and  Gleser  (1953)  devised  a modification 
of  the  generalized  distance,  or  d square,  of  Mahalanobis 
(1936) . Their  revision  seemed  to  have  merit  for  computa- 
tions with  psychological  data  (Roulon,  Tiedemen,  Tatsuoka, 
and  Langmuir,  1967)  . Their  measure,  d,  is  the  square  root 
of  the  summation  of  squared  distances  between  each  pair  of 
orthogonal  variables  of  two  profiles. 

Later  examples  of  techniques  utilizing  distance  mea- 
sures include  McQuitty's  (1961)  typal  analysis  and  the 
hierarchial  grouping  techniques  developed  by  Cattell  and 
Coulter  (1966)  and  by  Ward  (1963) . Modifications  of  the 
latter  method  have  been  developed  by  Rock  and  Barone  (1967) 
and  have  been  used  by  Rock  and  others  (Klein,  Rock,  and 
Evans,  1968;  Flauger  and  Rock,  1969)  and  by  Schoenfeldt 
(1970)  to  investigate  the  predictive  aspects  of  subgroups 
formed  from  biographic  data. 
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Criticisms  of  the  correlation  techniques  (Cronbach  and 
Gleser,  1953;  Gaier  and  Lee,  1953;  Guertin,  1966)  point  out 
that  they  pay  attention  to  shape  while  ignoring  level  and 
scatter.  Guertin  (1966)  pointed  out  that  among  the  problems 
of  distance  measures,  two  profiles  with  opposite  shapes  may 
yield  a small  d square  when  they  are  of  the  same  level  and 
the  variance  is  small.  He  developed  a complex  system  in- 
volving factor  analysis  of  both  correlation  and  distance 
measures  to  circumvent  the  shortcomings  of  each  method  when 
used  independently.  The  procedure  as  it  is  now  developed 
(Guertin  and  Bailey,  1970)  seems  particularly  valuable 
because  it  involves  consideration  of  shape,  level,  and 
variance . 

Seay  (1970)  utilized  Guertin' s method  in  a study  which 
examined  data  collected  for  over  five  hundred  first  grade 
pupils  with  the  Orange  County  Inferred  Self-Concept  Scale. 
The  technique  established  10  male  and  10  female  patterns 
from  a sample  of  the  subjects.  All  profiles  were  compared 
to  each  pattern  of  the  appropriate  sex  and  a d square  mea- 
sure of  congruence  was  obtained  at  each  comparison.  The 
resulting  d square  measures  were  analyzed.  Among  the  find- 
ings were  differences  in  male  and  female  self-concept 
patterns.  Seay  did  not,  however,  attempt  to  generalize  to 
any  other  population. 

Elliott  (1970)  also  utilized  Guertin ' s method  in  iso- 
lating subgroups  of  teachers  and  students  in  a Florida  high 


7 


school.  He  designed  two  instruments,  one  for  teachers 
and  one  for  students,  both  of  which  contained  questions  of 
a biographic  nature  and  questions  concerning  teaching  theory 
and  practices.  The  analysis  resulted  in  the  identification 
of  four  student  patterns  and  two  teacher  patterns.  D square 
measures  of  congruence  were  calculated  for  each  profile  with 
each  pattern.  Product-moment  correlations  were  then  calcu- 
lated between  those  measures  (by  pattern)  with  the  student 
ranking  of  preference  for  each  teacher.  Out  of  204  corre- 
lations only  16  were  significant  at  the  .05  level.  This  (16 
out  of  204)  was  not  significantly  better  than  chance;  there- 
fore the  student  patterns  could  not  be  considered  really 
predictive  (modal) . 

Chang  (1971)  examined  several  approaches  to  profile 
analysis  in  comparing  their  relative  efficiency  for  grouping 
the  50  states  of  the  Union.  She  used  measures  derived 
through  shape  analysis,  distance  analysis,  conventional  R 
analysis,  and  Guertin's  method  combining  the  first  two  tech- 
niques. Chang  concluded  that  Guertin's  profile  analysis 
technique  yielded  information  that  was  unavailable  from  the 
other  analyses , and  therefore  it  was  particularly  valuable 
when  it  is  necessary  to  go  beyond  the  mere  derivation  of 
groups  and  explain  the  bases  (characteristics)  on  which  the 
groups  were  formed. 
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Statement  of  the  Problem 

The  present  study  represents  an  effort  to  determine 
whether  profile  analysis  can  be  used  to  derive  variables 
for  prediction  purposes.  Further,  it  is  an  attempt  to  show 
whether  more  efficient  predictor  variables  can  be  derived 
from  shape  measures  (Q  analysis),  distance  measures,  or  from 
a combination  of  the  two.  The  problem  questions  were 
written  so  as  to  indicate  an  exploratory  study  of  method- 
ologies rather  than  one  in  which  the  accuracy  of  hypotheses 
is  adjudged.  Those  questions  are: 

1.  Can  profile  analysis  be  used  to  provide  input 
variables  for  multiple  regression  analysis  of 
academic  success  measures? 

2.  If  so,  are  the  most  effective  variables  produced 
through  analysis  of  shapes,  distance,  or  a succes- 
sive combination  of  the  two? 

3.  When  used  to  predict  success,  are  variables  derived 
from  profile  measures  superior  to  variables  com- 
posed of  test  items  or  test  factor  scores? 

The  study  is  strictly  limited  to  consideration  of  the  pre- 
dictive efficiency  of  variables  produced  by  the  various 
methods,  and  does  not  investigate  characteristics  of  the 
subgroups  isolated  by  profile  analysis. 

Data  from  a previous  study  (Clarke  and  Ammons,  1970) 
were  reexamined  according  to  the  questions  listed  above. 

That  study  reported  evidence  of  the  importance  of  a 
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combination  of  affective  and  cognitive  measures  in  pre- 
dicting grade  point  average  (GPA)  in  the  first  semester  of 
enrollment  for  first-time-in-college  graduates  of  Florida 
high  schools  who  entered  St.  Petersburg  Junior  College  in 
August,  1967.  Utilizing  the  multiple  regression  model  with 
conventional  variables,  the  investigators  attempted  to 
predict  differentially  for  students  subdivided  according  to 
race  and  sex.  The  amount  of  variance  accounted  for  ranged 
from  19%  for  the  white  male  subgroup  to  50%  for  the  black 
female  subgroup  (see  Appendix,  Table  A). 


CHAPTER  2 


PROCEDURE 

The  problem  of  comparing  predictive  effectiveness  of 
profile  variables  derived  either  from  shape  analysis  or  from 
distance  analysis  with  that  of  variables  derived  from  a 
successive  combination  of  the  two  techniques  places  certain 
restrictions  on  the  particular  methods  chosen  for  the  inves- 
tigation. For  a realistic  comparison  of  the  third  technique 
with  the  other  two,  it  is  mandatory  that  its  respective 
modules  (shape  and  distance  measures)  be  compatible  with  the 
shape  technique  and  the  distance  technique  which  are  used 
independently.  Guertin's  profile  analysis  technique  is 
ideally  suited  to  this  purpose,  as  the  shape  and  distance 
modules  are  available  as  separate  computer  programs  of  his 
Education  Evaluation  Library,  hereafter  referred  to  as  EEL 
(Guertin  and  Bailey,  1970) . The  Profile  Analysis  Package 
(EEL  777)  as  it  currently  exists  is  a multistage  system 
whereby  a test  score  matrix  is  transposed  to  produce  a 
between-persons  correlation  matrix  which  is  factor  analyzed 
by  the  principal  axes  method  and  rotated  to  the  varimax 
criterion.  The  factors  thus  produced  yield  groups  of 
individuals  whose  profiles  are  similar  in  shape.  Each  shape 
factor  is  examined  separately  to  identify  clusters  of 
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profiles  whose  level  and/or  variability  are  similar.  This 
is  accomplished  by  comparing,  individually,  each  profile 
that  is  heavily  loaded  on  the  shape  factor  with  every  other 
profile  in  the  shape  factor  to  produce  a matrix  of  d mea- 
sures. The  difference  between  each  of  the  corresponding 
points  of  the  profile  and  pattern  is  determined,  then 
squared.  The  squared  difference  values  are  summed  across 
all  variables  for  the  pair,  and  the  square  root  is  taken. 

The  matrix  thus  produced  is  one  of  dissimilarity  indices, 
and  is  converted  to  a matrix  of  similarity  indices  by  sub- 
tracting each  element  from  the  largest  element  of  the  matrix. 
Then  each  element  of  the  resultant  matrix  is  divided  by  the 
largest  element  of  that  matrix  to  give  values  ranging  down- 
ward from  unity.  The  largest  element  in  each  column  of  the 
final  matrix  is  inserted  in  the  diagonal  as  an  initial 
"communality " estimate.  This  matrix  is  factor  analyzed 
(principal  axes)  and  rotated  to  the  varimax  criterion  to 
locate  profiles  which  are  similar  in  level  and  variability. 

By  comparing  only  members  loading  on  the  same  shape  factor, 
this  approach  avoids  the  major  problem  of  most  "distance" 
methods,  that  of  obtaining  a moderately  high  similarity 
index  from  two  dissimilarly  shaped  profiles  at  the  same 
level . 

Each  column  of  the  d factor  matrix  is  searched  in  turn 
for  profiles  loading  on  it.  Two  profiles  must  load  heavily 
before  it  can  be  used  to  determine  a modal  pattern  (Guertin 
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Bailey,  1970,  pp.  261,  284,  292).  The  pattern  values  are 
computed  by  weighting  scores  of  the  pivot  profiles  (those 
profiles  loading  heavily  on  the  d factor)  proportionately 
to  the  size  of  their  d factor  loading  and  obtaining  the  mean 
value.  The  procedure  is  then  repeated  for  the  other  column (s) 
in  the  d factor  matrix. 

The  shape  factor  module  of  the  profile  analysis  package 
is  available  separately  as  EEL  504,  and  the  distance  factor 
module  is  available  as  EEL  510. 

Because  of  the  foregoing  considerations,  the  four 
methods  chosen  to  derive  variables  for  comparison  of  pre- 
dictive effectiveness  were  shape  factors  via  EEL  504, 
distance  factors  via  EEL  510,  types  via  EEL  777,  and  factor 
scores  (representing  R analysis)  via  EEL  518  (from  factors 
derived  by  EEL  501) . 

For  the  first  two  methods,  both  transpose  procedures, 
the  subject's  loadings  on  each  factor  were  considered  to  be 
variables  for  this  study.  Type  analysis  variables  were  the 
d's  for  each  subject,  comparing  him  with  each  pattern.  Of 
course,  the  factor  scores  themselves  were  used  as  conven- 
tional variables  (R  analysis) . 

All  of  the  above  species  of  variables  should  be  con- 
sidered as  being  of  a higher  order,  having  been  derived  from 
the  instruments  originally  used  by  Clarke  and  Ammons  (1970) 
in  the  fall  of  1967. 
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The  instruments  were  administered  to  all  of  the  students 
in  the  study  before  or  during  the  fall  registration  period. 

Two  test  batteries  (1  and  2 below)  measuring  academic  skills 
had  been  administered  prior  to  enrollment.  Three  other 
tests  (3,  4,  and  5 below),  all  concerned  with  the  affective 
domain,  were  administered  to  each  student  during  registra- 
tion. The  instruments  were: 

1.  The  Florida  Twelfth  Grade  Statewide  Testing  Program 
(FTGSTP) — this  restricted  battery,  designed  by  the 
Educational  Testing  Service  and  administered  to  all 
Florida  twelfth  grade  students  in  the  fall  of  their 
senior  year,  includes  measures  indicating  achieve- 
ment level  in  General  Aptitude,  English,  Social 
Sciences,  Natural  Science,  and  Mathematics. 

2.  School  and  College  Ability  Test  (SCAT) --two  subscores 
Linguistic  and  Quantitative,  and  a Total  are  reported 

3.  How  I See  Myself  (HISM) — this  self-concept  scale 
normally  yields  attitude  scores  in  nine  areas: 
Teacher-School,  Physical  Appearance,  Interpersonal 
Adequacy,  Autonomy,  Physical  Adequacy,  Emotions, 
Boy-Social,  and  Girl-Social.  There  are  42  items  in 
the  test  (Gordon,  1968) . 

4.  Social  Reaction  Inventory  (SRI) — an  unpublished  adap- 
tation of  this  instrument  was  utilized.  The  inven- 
tory yields  a single  raw  score  which  indicates 
locus  of  control  (internal  vs.  external)  as  per- 
ceived by  the  examinee  (Rotter,  1966) . 
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5.  Study  of  Values  (AVL) — this  instrument  indicates 
the  relative  strength  of  student  values  in  the 
following  areas:  Theoretical,  Economic,  Aesthetic, 

Social,  Political,  Religious  (Allport,  Vernon,  and 
Lindzey,  1960) . 

The  original  criterion,  first  semester  grade  point 
average,  included  all  credit  courses  taken  by  the  student. 

In  the  original  analysis  of  the  data  only  subtotal  and 
total  scores  were  utilized.  Responses  to  individual  test 
items  were  not  included.  However,  test  protocols  for  all 
instruments  except  the  Florida  Statewide  Twelfth  Grade 
Testing  Program  were  still  available,  noncognitive  instru- 
ments having  been  identified  by  code  rather  than  by  name  to 
preserve  confidentiality. 

The  response  method  of  How  I See  Myself,  a 5-point 
scale,  made  it  appropriate  to  include  the  individual  items 
as  variables  in  a profile  analysis.  However,  the  Study  of 
Values  consists  of  several  different  parts,  each  with  its 
particular  system  of  marking  choices,  and  the  Social 
Reaction  Inventory  items  require  a binary  response.  Neither 
instrument  was  considered  suitable  for  yielding  individual 
items  for  the  analysis.  Thus , the  study  included  the  follow- 
ing 56  first-  and  second-order  variables:  the  42  items  of 

How  I See  Myself,  the  five  subscores  of  the  Florida  State- 
wide Twelfth  Grade  Testing  Program,  the  two  subscores  of  the 
School  and  College  Ability  Test,  the  total  score  of  the 
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Social  Reaction  Inventory,  the  six  subscores  of  the  Study  of 
Values . 

It  was  decided  to  eliminate  from  the  sample  any  case 
for  whom  data  were  incomplete.  Because  variable  means  pre- 
viously had  been  substituted  for  missing  observations  in 
the  original  study  the  sample  size  was  reduced  considerably. 
Additionally,  it  was  decided  to  limit  the  study  to  the  white 
subgroups  since  the  number  of  blacks  was  too  small  to  pro- 
duce meaningful  results  if  considered  separately  and  was  too 
large  to  add  to  error  variance  with  equanimity.  Thus,  the 
present  study  included  a total  of  812  white  males  and  a 
total  of  649  white  females. 

The  work  was  done  in  four  phases , as  described  graphi- 
cally in  Figure  1.  First,  all  data  from  the  original  study 
were  retrieved.  Cumulative  records  were  searched  and  two 
additional  criterion  variables  were  included  for  each  case. 
They  were  the  number  of  semester  hours  of  credit  attempted 
over  two  calendar  years  and  the  accumulated  GPA  over  two 
calendar  years. 

Transformation  to  z scores  was  accomplished,  and  two 
subsamples  were  drawn  for  each  sex  (see  Figure  2) . The 
second  phase  (see  Figure  3)  utilized  the  first  subsamples 
for  development  of  the  higher-order  variables  and  determina- 
tion of  the  predictive  equations  based  on  each  species  of 
variable.  This  was  done  separately  for  each  sex.  Phase 
three  (see  Figures  4 and  5)  consisted  of  cross-validation  of 
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Figure  1.  Overview  of  the  procedures. 
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Figure  2.  Score  transformation  and  sampling. 
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Figure  3.  Creation  of  variables  and  development  of  regression  equation 
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Figure  5.  Cross-validation  and  comparison  of  methods 
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all  equations  with  the  second  subsamples,  while  the  last 
phase  involved  a comparison  of  the  variable  species  based 
on  their  effectiveness  as  predictors  (included  in  Figure 
5)  . 

Phase  One 

The  original  test  data  for  all  subjects  were  obtained 
and  examined  for  completeness.  All  cases  with  missing 
observations  were  deleted. 

Scores  on  the  FTGSTP  were  originally  reported  as  per- 
centiles based  on  statewide  norms  and  therefore  were  con- 
verted to  z scores  via  a revised  version  of  EEL  588, 
Percentiles  to  z Scores.  Several  cases  (five  males,  six 
females)  had  one  subscore  indicated  as  00,  a condition 
which  though  theoretically  impossible,  does  appear  in  prac- 
tice with  this  particular  battery.  Such  a score  has  been 
known  to  represent  either  a missing  observation  or  an 
extremely  poor  test  performance.  Visual  examination  of  all 
other  cognitive  variables  for  each  of  the  11  cases  indicated 
that,  except  for  one  subject,  poor  test  performance  was 
typical.  The  one  subject  with  average  scores  on  all  other 
cognitive  variables  was  assigned  a z score  equal  to  the  mode 
of  his  other  FTGSTP  subscores  on  the  assumption  that  his  00 
represented  a missing  observation.  A z score  of  -3  was 
assigned  to  the  00s  of  the  other  cases  since  that  is  a com- 
monly used  score  for  denoting  the  lowest  point  on  a continuum 
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of  scores  in  z conversion.  This  is  based  on  the  fact  that 
99.7%  of  all  scores  in  a normal  distribution  lie  above  a z 
score  of  -3. 

EEL  588  converts  percentiles  to  z scores  based  on  the 
normal  distribution.  The  converted  FTGSTP  scores  were  re- 
turned to  the  cards  containing  the  other  variables  and  all 
were  converted  to  z scores  via  EEL  587,  Transformation  to  z 
Scores  from  Raw  Data.  Thus,  all  variables  were  based  on  a 
comparable  scale. 

A weighting  system  was  devised  to  compensate  for  the 
fact  that  all  HISM  variables  consisted  of  individual  items 
while  all  other  predictor  variables  were  scores  represent- 
ing aggregates  of  items.  An  arbitrary  decision  was  made  to 
allocate  the  weights  so  that  cognitive  and  affective  vari- 
ables each  contributed  one  half  of  the  total  weight.  It  was 
found  that  a weighting  function  of  350  would  allow  the 
desired  breakdown.  Therefore,  the  variables,  after  z score 
transformation,  were  multiplied  by  the  weights  shown  in 
Table  1. 

In  order  to  compensate  for  shrinkage  due  to  error  in 
sampling,  it  was  decided  to  develop  predictive  equations 
with  one  subsample  of  each  sex  (hereafter  referred  to  as 
subsample  one) , then  cross-validate  by  using  those  equa- 
tions to  predict  for  each  sex  with  the  remaining  subsample 
(subsample  two) . 

The  profile  analysis  computer  package  is  limited  to 
100  subjects,  and,  since  that  number  constituted  over  12% 
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TABLE  1 

Weighting  of  Variables 


Variable 

Weight  Units 

Total 

Affective 

variables 

HISM  items 

1 

42 

42 

AVL  subscores 

19 

6 

114 

SRI 

19 

1 

19 

Affective  sum 

175 

Cognitive 

variables 

SCAT  V 

25 

1 

25 

SCAT  Q 

25 

1 

25 

FRGSTP 

25 

5 

125 

Cognitive  sum 

175 

of  the  male  subjects  and  over  15%  of  the  female  subjects, 
it  seemed  appropriate  to  limit  each  subsample  to  100  cases. 
Since  two  or  more  congruent  profiles  must  be  available  in  a 
sample  in  order  for  a pattern  to  emerge,  it  was  decided  to 
utilize  a rectangular  sampling  technique  in  an  attempt  to 
bring  out  the  maximum  number  of  patterns.  This  was  accom- 
plished by  first  stratifying  each  sex  subgroup  of  the  com- 
plete sample  according  to  total  scores  on  the  School  and 
College  Ability  Test.  Two  subsamples  of  100  cases  were 
drawn  from  each  sex  group  by  drawing  the  first  and  second 


24 


cases  of  each  group,  then  every  twelfth  and  thirteenth  case 
thereafter  for  females  and  every  sixteenth  and  seven- 
teenth case  for  the  males.  Alternate  cases  were  assigned 
to  each  subsample  for  each  sex. 

Phase  Two 

In  order  to  produce  the  shape  variables,  one  subsample 
for  each  sex  was  factor  analyzed  by  EEL  504.  Distance  vari- 
ables were  obtained  with  EEL  510.  The  principal  axes  factor 
matrices  produced  in  the  analyses  were  rotated  to  the  vari- 
max  criterion  via  EEL  511.  Loadings  of  the  subjects  on  the 
rotated  factors  constituted  the  variables,  as  both  proce- 
dures produced  groupings  of  cases  rather  than  of  character- 
istics (test  scores,  items). 

Type  variables  were  produced  for  each  sex  by  processing 
the  first  subsamples  with  EEL  777.  In  addition  to  punching 
the  d's  showing  the  relationship  of  each  individual  to  each 
pattern  the  program  punched  means  and  weights  for  each 
pattern.  The  d's  became  type  variables,  while  the  pattern 
means  and  weights  were  used  in  cross-validation. 

Variables  representing  a conventional  R analysis 
approach  were  produced  by  EEL  518,  Factor  Scores.  The  fac- 
tor matrix  and  the  r matrix  required  for  the  procedure  were 
obtained  through  EEL  501,  the  basic  factor  analysis  program 
of  the  Educational  Evaluation  Library. 

All  four  species  of  variables  were  examined  with  step- 
wise multiple  regression  analysis  to  determine  the  best 
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combination  and  weighting  of  variables  (within  a species) 
for  predicting  each  of  the  three  criteria.  Analysis  using 
BMD02R  (Dixon,  1967)  was  performed  separately  for  each  sex, 
resulting  in  24  separate  equations. 

Phase  Three 

The  problem  of  error  in  sampling  is  particularly 
critical  when  judgments  are  to  be  made  concerning  the  rela- 
tive predictive  efficacy  of  several  regression  equations. 

The  very  nature  of  the  multiple  regression  technique  would 
lead  one  to  expect  a decrease  in  accuracy  of  prediction  when 
a regression  formula  is  applied  to  a group  other  than  the 
one  from  which  its  Beta-weights  are  obtained.  This  phenome- 
non, shrinkage,  occurs  when  the  operation  of  chance  arrange- 
ments in  sampling  error  in  the  predictor  variables  results 
in  a more  or  less  idiosyncratic  set  of  Beta-weights.  Thus, 
cross-validation  becomes  a vital  part  of  any  predictive 
study. 

Several  methods  of  compensating  for  the  effects  of 
shrinkage  have  been  proposed  (Mouly,  1970;  Travers,  1969). 

In  the  present  study  it  was  decided  to  cross-validate  the 
24  regression  equations  produced  in  phase  two  by  applying 
them  to  similar  variables  derived  from  subsample  two. 

Type  variables  from  the  second  subsample  were  produced 
by  comparing  each  of  those  subjects  with  each  pattern  that 
had  been  derived  from  the  first  subsample  with  EEL  777.  A 
program  was  written  utilizing  the  original  pattern  means  and 
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weights  to  compute  dissimilarity  measures  ( NEd2)  in  a 
manner  identical  to  the  one  used  for  that  purpose  in  the 
distance  module  of  EEL  777  (see  p.  11) . 

Cross-validation  of  the  equations  produced  through 
analysis  of  the  variables  which  actually  were  factor  load- 
ings required  that  a system  be  devised  whereby  factor  load- 
ings of  subjects  in  the  second  subsamples  would  be  estimated 
on  factors  derived  from  the  scores  of  subjects  in  the  first 
subsamples.  Since  both  the  shape  and  the  distance  analyses 
were  transpose  procedures,  the  estimation  of  factor  load- 
ings for  the  subjects  was  basically  the  same  problem  as 
estimating  factor  loadings  of  tests  not  included  in  the 
original  composition  of  a conventional  factor  analysis. 
Simply  stated,  the  problem  was  to  devise  a method  of  extend- 
ing the  factor  matrices. 

Mosier  (1938)  clarified  and  expanded  previous  methods 
of  extending  a factor  matrix  for  use  mainly  with  results  of 
centroid  or  principal  axes  analysis.  His  equation  (sub- 
stituting current  notation  of  matrix  orientation  for  that 
used  by  Mosier)  is  presented  below. 

V = R p G" 1 T 

L2K  L2Li  LiK  KK  KK 

where : 

Li  is  the  number  of  original  variables, 

L2  is  the  number  of  additional  variables, 

K is  the  number  of  factors. 


27 


represents  a matrix  of  estimated  factor  loadings  from 

a rotated  solution.  RT  _ is  a matrix  formed  by  cross 

L2  L i 

correlation  of  the  additional  variables  with  those  in  the 
original  analysis.  F is  the  original  unrotated  factor 

1 I\ 

matrix.  G“*  is  the  inverse  of  the  matrix  product  F'  FT  T,. 

KLi  LiK 

The  transformation  matrix  T was  produced  in  an  earlier 
step  and  served  to  translate  estimates  on  the  unrotated 
factor  matrix  to  estimates  on  the  rotated  matrix. 

Taking  advantage  of  modern  computer  techniques,  Mosier's 
equation  was  simplified  by  using  the  factor  matrix  result- 
ing from  the  varimax  solution  rather  than  that  from  the 
principal  axes  solution.  Thus,  the  transformation  matrix 
was  eliminated  from  computation  and  the  equation  became: 

VL2K  = RL2L!  VLiK  gkk* 

In  the  above  equation  V is  the  factor  matrix  produced  by 

Li  1 x\ 

the  varimax  rotation  of  F ; G“*  is  the  inverse  of  the 

Li  1 i\  i\J\ 

matrix  product  V'  V_  _ _ . 

J\JLi  i L i K 

The  present  study  required  the  preparation  of  special 
computer  programs  to  produce  and  multiply  the  matrices  of 
the  revised  Mosier  formula.  In  order  to  extend  the  factor 
matrix  produced  by  EEL  510  (distance  factors)  an  adap- 
tation of  EEL  529,  Transpose  Cross  Correlation,  produced 
a matrix  of  distance  measures  comparing  each  subject  in 
one  subsample  with  each  subject  in  the  other  subsample. 

That  matrix  was  employed  for  the  R. 


matrix.  The 


28 


program,  a transpose  procedure,  first  computed  a \jzd2 
matrix,  then  converted  those  dissimilarity  measures  into  a 
matrix  of  similarity  measures  with  the  same  technique  by 
which  this  is  done  in  EEL  777. 

Conventional  variables  for  cross-validation  purposes 
were  produced  by  computing  factor  scores  for  subjects  in 
the  second  subsample  via  EEL  518.  The  factor  matrix  and 
r matrix  utilized  in  that  operation  were  those  derived  from 
processing  the  first  subsample  with  EEL  501. 

Phase  Four 

After  re-creation  of  all  variables  with  the  second 
subsample,  the  particular  equation  resulting  from  multiple 
regression  analysis  was  applied  to  the  appropriate  variables. 
Relationships  of  the  obtained  values  to  the  predicted  values 
of  the  dependent  variables  were  investigated  by  means  of 
product-moment  correlations. 


CHAPTER  3 


RESULTS 

The  major  question  here  was  not  that  of  determining 
which  of  the  four  methods  accounted  for  the  greatest  amount 
of  variance  of  the  original  data  matrix,  nor  that  of  deter- 
mining which  was  the  most  parsimonious  solution.  Rather, 
the  study  was  designed  to  compare  the  higher-order  variables 
(empirical  constructs)  produced  by  the  various  methods  as 
to  their  effectiveness  in  predicting  certain  selected  cri- 
teria. It  is  in  order,  however,  to  present  data  concerning 
those  variables  as  a backgrourld  for  the  information  derived 
from  multiple  regression  analysis.  Since  the  emphasis  was 
on  the  quantitative  rather  than  the  qualitative  aspects  of 
the  variables , no  attempt  has  been  made  to  describe  their 
actual  content. 

Results  of  the  procedures  detailed  in  Chapter  2 will 
be  presented  in  three  parts:  a quantitative  description 

of  the  results  (higher-order  variables)  of  the  factor 
analysis  procedures  and  the  profile  analysis  technique; 
results  of  the  multiple  regression  analysis;  results  of  the 
cross-validation  process. 

The  Variables 

A direct  comparison  of  the  methods  for  producing  the 
higher-order  variables , even  though  it  be  quantitative  in 
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nature,  cannot  be  done  with  complete  integrity.  Though  the 
original  data  matrix  was  the  same  for  all  four  methods, 
it  will  be  recalled  that  each  method  analyzed  the  matrix 
from  a different  perspective,  resulting  in  four  distinctly 
different  species  of  variables.  Therefore,  comparisons  pre- 
sented here  are  considered  to  be  of  the  higher-order  vari- 
ables rather  than  of  the  methods  by  which  they  were  produced. 

Information  concerning  the  results  of  the  factor 
analysis  procedures  is  presented  in  Tables  2 and  3. 

Conventional  R analysis  with  the  male  subgroup,  using 
EEL  501,  produced  39  principal  axes  which  accounted  for  73.51% 
of  the  total  variance.  Rotation  to  the  varimax  criterion 
resulted  in  a 16-factor  solution  accounting  for  60%  of  the 
total  variance.  That  figure  was  81.62%  of  the  cortmon 
variance . 

Analysis  with  EEL  501  for  the  females  also  produced  39 
principal  axes  which,  when  rotated  to  the  varimax  criterion, 
resulted  in  a 16-factor  solution.  The  principal  axes 
solution  accounted  for  73.51%  of  the  total  variance,  while 
the  16-factor  varimax  solution  accounted  for  60.08%  of  total 
variance,  that  being  81.98%  of  common  variance. 

As  shown  in  Table  3 , except  for  the  first  rotated 
factor,  the  variance  in  R analysis  was  distributed  rather 
consistently  through  the  16  factors  for  both  the  male  and 
the  female  subgroups . 


TABLE  2 

Comparisons  of  Variables  Produced  by  Three  Methods  of  Factor 
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TABLE  3 

Sums  of  Squared  Loadings  of  the  Rotated  Factors 


Factor 

number 

R analysis 

Q an 

alysis 

D analysis 

Male 

Female 

Male 

Female 

Male 

Female 

1 

6.08 

6.36 

26.26 

27.43 

37.18 

38.79 

2 

2.66 

2.71 

5.85 

9.12 

34.09 

36.49 

3 

2.26 

2.51 

8.04 

10.51 

4.35 

7.57 

4 

1.81 

1.73 

8.94 

4.78 

7.12 

2.57 

5 

1.83 

1.79 

7.07 

7.24 

3.17 

1.42 

6 

1.54 

1.44 

8.60 

3.16 

1.79 

7 

2.09 

1.83 

5.28 

10.01 

8 

1.94 

2.52 

4.22 

4.31 

9 

1.67 

1.69 

4.70 

5.85 

10 

1.69 

1.59 

4.56 

6.14 

11 

2.08 

1.54 

6.33 

4.24 

12 

1.48 

1.71 

4.69 

13 

1.35 

1.53 

14 

1.54 

1.83 

15 

1.74 

1.69 

16 

1.84 

1.18 

Distance  analysis  with  EEL  510  produced  36  principal 
axes  for  the  male  subgroup.  That  solution  accounted  for 
96.58%  of  the  total  variance.  Rotation  to  the  varimax  cri- 
terion resulted  in  six  factors  which  accounted  for  87.71% 
of  total  variance,  that  being  90.93%  of  all  common  variance. 

Similar  results  were  obtained  for  the  female  subgroup 
when  it  was  processed  with  EEL  510.  Again,  36  principal 
axes  were  produced,  accounting  for  96.07%  of  total  variance 
Rotation  to  the  varimax  criterion  resulted  in  only  five 
factors,  however.  That  solution  accounted  for  86.83%  of  the 
total  variance,  which  was  90.41%  of  all  common  variance. 
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Inspection  of  Table  3 shows  that  the  variance  in  the 
distance  analyses  was  concentrated  heavily  in  the  first  two 
rotated  factors  for  both  sexes. 

Q analysis  with  EEL  504  produced  14  principal  axes 
from  the  data  matrix  of  the  male  subgroup.  The  principal 
axis  solution  accounted  for  95.81%  of  total  variance,  and 
the  12-factor  varimax  solution  accounted  for  94.52%  of  the 
total  variance.  That  figure  was  98.66%  of  the  common 
variance . 

For  the  females,  Q analysis  with  EEL  504  also  resulted 
in  14  principal  axes,  accounting  for  95.94%  of  the  total 
variance.  Rotation  to  the  varimax  criterion  produced  an 
11-factor  solution  accounting  for  92.77%  of  the  total  vari- 
ance, which  was  96.70%  of  common  variance. 

For  both  males  and  females,  the  first  rotated  factor 
accounts  for  a large  portion  of  variance  (Table  3) . On  the 
whole,  however,  there  does  not  seem  to  be  an  obvious  pattern 
to  the  array  of  sums  of  squared  loadings.  This  is  expecially 
true  for  the  factors  derived  from  the  data  matrix  of  the 
female  subgroup,  and  is  in  contrast  to  the  observations  made 
of  the  loadings  in  R analysis  and  D analysis.  That  is  to 
say,  the  sums  of  the  squared  loadings  from  Q analysis  showed 
neither  the  relative  consistency  of  those  from  R analysis 
nor  the  rapidly  tapering  effect  (except  for  the  third  male 
factor)  of  D analysis. 

As  described  in  Chapter  2,  Guertin's  method  for  iso- 
lating patterns  makes  successive  use  of  shape  analysis  and 


34 


distance  analysis.  In  the  present  study,  the  shape  factors 
were  the  same  as  those  reported  as  Q analysis.  Distance 
analysis  performed  independently  on  each  of  the  12  shape 
factors  of  the  male  subgroup  produced  a total  of  25  patterns. 
The  same  process  performed  for  the  11  shape  factors  of  the 
females  produced  24  patterns.  Table  4 presents  the  number 
of  profiles  loading  on  each  shape  factor  and  the  resultant 
number  of  patterns  that  were  extracted.  Sums  of  the  squared 
loadings  for  each  of  the  shape  factors  are  shown  in  Table  3 
under  the  heading  Q analysis. 


TABLE  4 


Patterns  Derived  From  the  Shape  Families 


Shape 

factor 

number 


Number 
of  cases 


Number  of 
patterns 


Males 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 


21 

6 

11 

5 

3 

5 

4 
8 

17 

6 
4 
4 


3 

3 

3 
2 
1 
1 
2 
2 

4 
2 
1 
1 
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TABLE  4 — (continued) 

Shape 

factor 

Number 

Number  of 

number 

of  cases 

patterns 

Females 

1 

22 

4 

2 

5 

2 

3 

7 

2 

4 

5 

2 

5 

9 

3 

6 

4 

1 

7 

5 

2 

8 

20 

4 

9 

4 

2 

10 

5 

2 

11 

6 

2 

The  Equations 

A total  of  24  multiple  regression  equations  were  pro- 
duced in  the  present  study  as  an  attempt  to  determine  the 
best  combination  and  weighting  of  variables  within  each 
species  for  predicting  each  of  the  three  criterion  scores. 
Separate  equations  were  computed  for  each  sex.  An  F test 
was  performed  on  the  results  of  each  stepwise  regression 
analysis  to  determine  whether  the  multiple  R of  the  final 
step  was  significantly  greater  than  the  coefficient  produced 
in  the  first  step  of  that  procedure  (Guilford,  1965,  p.  403). 
Differences  were  significant  (at  least  p < .05)  for  8 of  the 

12  equations  for  the  male  subgroup,  but  none  of  the  differ- 
ences in  the  females'  equations  reached  that  level.  Appendix 
Tables  B,  C,  and  D show  the  order  of  selection  of  each  vari- 
able in  each  equation,  giving  the  multiple  correlation 
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coefficient  at  each  step.  F.  Ratios  for  the  difference  be- 
tween predictive  effectiveness  in  the  first  and  the  last 
steps  of  each  of  the  multiple  regression  procedures  are  pre- 
sented in  Appendix  Table  E. 

Regression  weights  for  the  variables  in  each  equation 
are  presented  in  Tables  5 and  6. 

Cross-Validation 

Using  the  weighted  scores  of  subjects  in  the  second 
subsamples,  each  species  of  variable  was  re-created  from  the 
data  obtained  by  analysis  of  the  first  subsamples.  Estimated 
loadings  on  factors  from  the  shape  analysis  and  the  distance 
analysis  were  produced  with  the  revised  Mosier  formula  de- 
scribed in  Chapter  2.  Factor  scores  were  obtained  with  the 
correlation  matrix  and  the  factor  matrix  from  the  earlier 
analysis  with  EEL  501,  and  profile  scores  were  prepared  to 
show  the  relationship  of  each  subject  to  each  pattern  de- 
rived from  the  first  subgroup. 

Multiplication  of  the  newly  created  variables  by  the 
regression  weights  presented  in  Tables  5 and  6 produced  a 
matrix  of  estimated  criterion  scores  (y) . When  these  were 
correlated  (Pearson  r)  with  the  obtained  criterion  scores 
(y)  they  produced  a matrix  of  coefficients  which  were  shrunken 
versions  of  the  original  multiple  correlation  coefficients. 
That  is,  the  over- inf lated  multiple  correlation  coefficients 
which  resulted  from  chance  arrangement  of  error  in  the  first 
samples  were  smaller  (as  expected)  in  the  second  equations 
due  to  a different  distribution  of  error  in  the  second 
subsamples.  Table  7 shows  the  original  multiple  correlation 


TABLE  5 


Regression  Weights  of  Variables  in  Each  Predictive  Equation 

(Males) 


Regression  weights 


Variable  First  semester  GPA  Hours  attempted  Total  GPA 


Types 


stant 

1.26925) 

2.59226) 

0.83611) 

1. 

-0.05131 

-0.12597 

-0.10960 

2. 

-0.03321 

0.14325 

0.05123 

3. 

0.04134 

-0.04113 

0.06664 

4. 

0.01587 

-0.01489 

0.02753 

5. 

0.01365 

0.01404 

0.01137 

6. 

0.02819 

0.00944 

0.00524 

7. 

-0.21650 

-0.16985 

-0.13885 

8. 

-0.10737 

-0.15650 

-0.07568 

9. 

0.11180 

0.10227 

-0.01931 

10. 

0.03562 

0.04843 

0.04137 

11. 

-0.04788 

-0.02971 

-0.07611 

12. 

-0.00882 

0.01646 

0.01632 

13. 

0.05560 

-0.00692 

0.02864 

14. 

-0.00524 

0.00799 

0.00290 

15. 

-0.01972 

-0.02444 

-0.02791 

16. 

* * * 

-0.13642 

0.03696 

17. 

0.04545 

0.08249 

0.01955 

18. 

0.02635 

-0.08359 

0.02275 

19. 

-0.00922 

0.00811 

-0.00589 

20. 

-0.07256 

*** 

-0.09246 

21. 

0.01498 

0.05194 

0.04179 

22. 

0.01881 

0.05780 

0.07102 

23. 

-0.04048 

-0.01106 

-0.06045 

24. 

-0.01449 

-0.03117 

-0.05553 

25. 

0.16528 

-0.15253 

0.15923 

Shape 

, stant 

0.05833) 

0.06653) 

0.0686) 

1. 

1.04668 

0.97954 

0.94089 

2. 

0.66930 

0.35639 

0.39441 

3. 

* * * 

0.14312 

0.19766 

4. 

-0.44898 

-0.69873 

-0.90220 

5. 

0.35001 

0.48683 

0.47318 

6. 

0.11886 

-0.25732 

-0.11474 

7. 

0.09196 

0.31296 

-0.94049 

8. 

-0.63591 

0.06589 

0.19001 

9. 

-1.43616 

-0.89838 

-1.06961 

10. 

0.38132 

0.46380 

0.35215 

11. 

-1.37395 

-1.50211 

-1.04249 

12. 

0.22362 

* * * 

0.45622 

TABLE  5— (continued) 


Regression  weights 

Variable  First  semester 

GPA 

Hours  attempted 

Total  GPA 

Distance 

(constant 

-0.01961) 

-4.16216) 

0.41203) 

1. 

-0.87110 

2.86827 

-1.22607 

2. 

0.24442 

4.38697 

0.81785 

3. 

-0.29315 

0.54926 

-0.61605 

4. 

0.12172 

1.25319 

0.29904 

5. 

-1.14730 

-1.56654 

-1.24383 

6. 

1.59680 

2.04708 

1.04065 

Factor 

scores 

(constant 

0.04439) 

0.04444) 

0.03692) 

1. 

-0.03008 

-0.04862 

-0.03388 

2. 

-0.09543 

-0.11123 

-0.09381 

3. 

0.01128 

0.00910 

-0.00992 

4. 

-0.02544 

-0.06395 

-0.04848 

5. 

-0.07125 

-0.10658 

-0.09891 

6. 

0.04098 

0.06528 

0.03874 

7. 

-0.01596 

-0.05038 

-0.02495 

8. 

0.01172 

* * * 

★ ** 

9. 

-0.09158 

-1.19848 

-0.09339 

10. 

0.09086 

0.06520 

0.08978 

11. 

-0.10254 

-0.01494 

-0.03816 

12. 

0.03703 

-0.01531 

-0.03573 

13. 

0.04792 

0.04851 

0.02201 

14. 

-0.01620 

* * * 

0.01069 

15. 

0.01683 

0.02935 

0.01833 

16. 

-0.13128 

-0.16753 

-0.13572 

. * ** 


Not  in  equation 


TABLE  6 


Regression  Weights  of  Variables  in  Each  Predictive  Equation 

(Females) 


Regression  weights 


Variable 

First  semester 

GPA 

Hours  attempted 

Total  GPA 

Types 

(constant 

-2.29134) 

-0.22278) 

-1.40351) 

1. 

-0.18528 

-0.30433 

-0.11212 

2. 

-0.02945 

0.01418 

* * * 

3. 

-0.08492 

-0.02613 

-0.07107 

4. 

0.26581 

0.33105 

0.19680 

5. 

-0.16533 

-0.18705 

-0.07887 

6. 

0.11228 

0.10141 

0.03636 

7. 

0.14001 

0.18963 

0.11560 

8. 

-0.03765 

-0.07957 

-0.02030 

9. 

0.06770 

0.15396 

0.06151 

10. 

-0.08262 

-0.12170 

-0.09908 

11. 

-0.10737 

-0.09678 

-0.08173 

12. 

-0.00907 

-0.04372 

* * * 

13. 

-0.03008 

-0.04517 

0.03305 

14. 

0.02403 

-0.06020 

0.02660 

15. 

-0.06616 

-0.02953 

-0.03088 

16. 

-0.01647 

* * * 

-0.02109 

17. 

0.19181 

0.07376 

0.11941 

18. 

0.07431 

0.11197 

0.05986 

19. 

-0.14383 

-0.17615 

-0.11063 

20. 

0.14141 

0.13147 

0.09464 

21. 

-0.36279 

-0.28793 

-0.32667 

22. 

-23 

0.11546 
n i a 3i  a 

0.21011 

0.10433 

0.10233 

0.10259 

24. 

0.12620 

0.16848 

0.08850 

25. 

-0.09311 

-0.11751 

-0.06037 

26. 

-0.00742 

Shape 

-0.01611 

-0.01461 

(constant 

-0.06627) 

-0.12858) 

-0.03804) 

1. 

0.71609 

0.23141 

0.79429 

2. 

0.07850 

0.31197 

0.10281 

3. 

* * * 

-0.12855 

-0.22820 

4. 

-0.08635 

0.52645 

-0.28692 

5. 

-0.39435 

-0.82316 

-0.20392 

6. 

-0.47445 

0.18961 

-0.56834 

7. 

-0.27475 

-0.17000 

-0.44393 

8. 

-0.75915 

-0.39158 

-0.54663 

9. 

0.10368 

-0.21323 

0.08825 

10. 

0.35895 

0.38826 

0.22583 

11. 

-0.22978 

* * * 

-0.50885 

TABLE  6 — (continued) 


Variable 

Regression  weights 

First  semester 

GPA 

Hours  attempted 

Total  GPA 

Distance 

(constant 

1.22765) 

-0.97311) 

1.17970) 

1. 

-2.23866 

-0.22484 

-2.24730 

2. 

0.22904 

1.29666 

0.28067 

3. 

-0.63574 

0.71228 

-0.43447 

4. 

0.38871 

0.96018 

0.67910 

5. 

-0.19736 

-0.42004 

*** 

Factor 

scores 

(constant 

-0.07795) 

-0.05806) 

-0.06726) 

1. 

0.03056 

0.05444 

0.05392 

2. 

0.04937 

-0.03005 

-0.03419 

3. 

-0.00550 

-0.01382 

-0.02162 

4. 

* * * 

-0.04772 

-0.01770 

5. 

-0.06985 

-0.02561 

-0.03729 

6. 

0.05213 

0.02652 

0.02046 

7. 

0.04835 

0.08225 

0.06755 

8. 

-0.10757 

-0.04965 

-0.02002 

9. 

-0.01670 

-0.02491 

-0.02849 

10. 

0.01382 

-0.03549 

-0.06368 

11. 

-0.03948 

0.16050 

0.11845 

12. 

0.04729 

0.04215 

0.04958 

13. 

*★* 

0.08665 

0.07772 

14. 

*** 

-0.01815 

0.02049 

15. 

-0.00601 

-0.04609 

-0.05273 

16. 

-0.01841 

-0.02859 

-0.00658 

*** 


Not  in  equation 


TABLE  7 

Relationships  of  the  Variables  to  the  Criteria 
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* significantly  different  from  zero  at  p < .05 

* significantly  different  from  zero  at  p < .01 

Note:  Significance  of  Mult.  R is  based  on  the  number  of  variables  in  the  equation. 

Significance  of  r ^ is  based  on  zero  order  r (Guilford,  1965,  pp.  580-581) . 
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coefficient  and  the  correlation  between  estimated  criteria 
and  obtained  criteria  (r  a)  . 

y*y 

Tests  of  Significance 

In  order  to  determine  whether  significant  differences 
existed  among  the  results  from  the  four  variable  species, 
the  cross-validation  r's  were  transformed  to  their  corre- 
sponding Fisher's  z coefficients  (Guilford,  1965,  p.  589), 
and  a two-way  analysis  of  variance  was  applied.  The  z 
coefficients  were  coded  to  facilitate  ease  in  computation. 

An  F ratio  significant  at  p < .01  (F  = 6.84;  df  3,  17)  re- 
sulted from  differences  between  methods  (species  of  variables) . 
Tukey's  w procedure  (Steel  and  Torrie,  1960)  was  then  utilized 
to  determine  which  method (s)  significantly  differed  from  others. 

In  the  Tukey  procedure  the  variable  species  were  placed 
in  order  of  the  magnitude  of  their  average  effectiveness  (the 
mean  of  the  coded  z coefficients  for  criteria) . This  was 
done  for  both  sexes  combined,  then  separately  for  each  sex. 

In  all  three  situations  the  ranking  was  the  same,  going  from 
least  effective  to  most  effective  in  the  following  order: 
shape,  distance,  type,  and  conventional  variables. 

When  data  for  both  sexes  were  combined,  the  mean  of  the 
coded  z coefficients  for  shape  variables  differed  signifi- 
cantly at  p < .05  from  the  mean  of  the  distance  variables  and 
at  p < .01  from  type  variables  and  conventional  variables. 

The  same  results  were  found  when  the  data  for  females  were 
examined  alone.  However,  examination  of  the  data  for  males 
revealed  no  differences  which  were  significant  at  the  afore- 


mentioned levels. 


CHAPTER  4 


DISCUSSION 

The  exploratory  nature  of  the  present  study  resulted 
in  a variety  of  findings  that,  though  valuable  and  inter- 
esting, were  only  of  tangential  consequence  to  the  research 
questions  set  forth  in  Chapter  1.  Having  presented  these 
data  in  the  preceding  chapters,  however,  it  is  appropriate 
to  discuss  the  auxiliary  aspects  as  well  as  those  directly 
connected  with  the  original  questions. 

Though  the  inclusion  of  variables  beyond  the  one  that 
entered  in  the  first  step  increased  prediction  significantly 
in  only  8 of  the  12  regressions  equations  for  males,  and  in 
none  of  those  for  females,  the  cross-validation  procedure 
utilized  the  equation  that  resulted  from  the  maximum  number 
of  steps.  As  pointed  out  in  Chapter  2,  the  multiple  regres- 
sion procedure  takes  advantage  of  and  maximizes  everything 
that  helps  predict  the  dependent  variable,  even  sampling 
error.  The  shrinkage  in  the  correlation  coefficients  in 
cross-validation  (see  Table  7)  indicates  that,  especially  for 
the  famale  subsamples,  there  was  a considerable  amount  of 
correlation  of  error  variance  in  addition  to  the  systematic 
variance  associated  with  individual  differences  related  to 
the  criterion  measures.  Perhaps  in  the  present  study  a more 
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parsimonious  solution  for  certain  regression  equations  might 
have  been  a better  choice.  Inclusion  of  all  variables  in 
each  equation  allowed  for  maximum  contamination  with  error, 
which  would  naturally  have  increased  the  percentage  of 
shrinkage. 

Attempts  to  predict  the  number  of  credit  hours  attempted 
over  a two-year  period  met  with  mixed  success  for  both  sexes. 
Inspection  of  Table  7 shows  that  fewer  of  the  original 
equations  were  significant  at  the  .01  level  of  confidence 
(one  out  of  four)  for  the  females  than  for  males  when  hours 
attempted  was  the  criterion  variable.  The  situation  reversed 
itself  on  cross-validation,  however,  as  three  out  of  the 
four  cross-validation  r's  of  the  female  subgroup  were  signifi- 
cant at  p < .01  but  none  of  those  for  the  male  subgroup 

fared  as  well.  Two  of  the  males'  cross-validation  r's  were 
significantly  different  from  zero  at  p<  ‘.05,  but  the  other 
two  were  not.  Considering  the  lack  of  reliability  of  that 
criterion  measure  the  results  were  not  unexpected.  As  a 
variable,  hours  attempted  typically  is  of  such  a heterogeneous 
nature  as  to  be  of  limited  usefulness  for  prediction  purposes. 
The  exploratory  spirit  of  the  present  work  allowed  inclu- 
sion of  that  variable  as  one  of  the  criterion  measures  in 
order  to  examine  its  relationship  to  the  atypical  species  of 
variables  produced  in  the  study.  The  results  serve  as  a 
reminder  that  a high  proportion  of  error  variance  is  no  less 
a sin  for  dependent  variables  than  it  is  for  independent 


45 


variables.  From  that  standpoint  it  is  not  possible  to  draw 
firm  conclusions  concerning  the  variables  themselves  insofar 
as  they  relate  to  hours  attempted. 

Turning  now  to  consideration  of  the  three  questions 
around  which  the  study  was  designed,  it  is  necessary  to 
review  certain  criticisms  of  the  various  pattern  analytic 
techniques  described  in  the  review  of  the  literature.  Shape 
variables,  it  will  be  recalled,  were  considered  inadequate 
because  they  do  not  consider  level  and  variance.  Distance 
measures  may  present  spuriously  low  d squares  when  two 
profiles  of  opposite  shapes  are  at  the  same  level  and  the 
variance  is  small.  Guertin's  method  of  profile  analysis  was 
designed  specifically  to  account  for  the  major  shortcomings 
of  the  other  two  methods. 

It  must  also  be  remembered  that  it  was  proposed  (Spence, 
1960)  that  greater  levels  of  understanding  of  phenomema  can 
be  attained  through  use  of  variables  of  a higher  level  of 
abstraction.  Therefore,  profile  analytic  procedures  would 
seem  to  be  superior  to  individual  test  scores  when  they  are 
used  as  independent  variables. 

Using  the  rationale  presented  in  the  two  preceding 
paragraphs,  it  was  speculated  that  the  four  species  of  vari- 
ables would  exhibit  their  effectiveness  in  a particular 
order  for  these  academic  criteria.  Shape  variables  were 
anticipated  to  predict  with  the  least  efficiency,  as  they 
represented  groupings  based  only  on  the  ipsativity  of  the 
members'  first-order  variables. 
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Distance  variables,  it  was  thought,  would  prove  to  be  better 
predictors  than  shape  variables,  but  not  as  efficient  as  the 
remaining  two  species.  It  was  speculated  that  the  factor 
scores  would  predict  less  well  than  the  d's  related  to 
Guertin's  method  as  factor  scores  involve  both  distance  and 
variability.  The  latter  technique  should  have  provided  the 
best  predictor  variables  if  the  rationale  were  correct,  as 
the  patterns  take  into  consideration  shape,  distance,  and 
variability.  Additionally,  they  constitute  constructs  which 
are  at  a relatively  high  level  of  abstraction. 

For  purposes  of  examining  the  data  relative  to  the 
specific  questions  posed  for  this  study,  attention  was  given 
to  the  cross-validation  r's.  Comparisons  were  based  on  the 
results  of  Tukey's  w procedure. 

As  anticipated,  the  shape  variables  proved  to  be  the 
poorest  predictors.  They  were  the  only  variables  to  show 
themselves  as  significantly  different  from  the  others  in 
predictive  effectiveness.  Observing  the  data  in  Table  7, 
it  is  interesting  to  note  that  shape  variables  did  not 
reach  predictive  effectiveness  at  p < .05  for  any  of  the 
three  criteria  in  the  female  subgroup,  though  they  reached 
that  level  with  two  of  the  criteria  of  the  male  subgroup. 

Distance  variables  had  been  expected  to  rank  just  above 
shape  variables  in  effectiveness.  This  was  found  to  be  true 
in  all  comparisons  made,  but  the  differences  were  not  sig- 
nificant. It  should  be  noted,  however,  that  differences  be- 
tween shape  variables  and  distance  variables  were  significant 
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at  p < .05  for  the  combined  sexes  and  for  females,  while 

differences  between  shape  and  both  type  and  conventional 
variables  were  significant  at  p < .01  for  the  aforementioned 

groups.  No  differences  of  significance  were  found  between 
any  variable  species  for  the  male  subgroup. 

It  had  been  anticipated  that  type  variables  would  be 
superior  to  conventional  variables  derived  from  factor 
scores  in  predicting  all  criteria.  Tukey's  procedure  showed 
no  differences.  However,  it  should  be  noted  that,  for  the 
female  subgroup,  the  type  and  factor  score  means  that  were 
tested  were  almost  equal. 

In  the  frame  of  reference  of  the  Tukey  procedure  applied 
to  the  data  from  Table  7,  the  three  problem  questions  (see 
Chapter  1)  will  be  considered  in  order. 

The  first  question,  querying  whether  profile  analysis 
can  be  used  to  provide  input  variables  for  multiple  regres- 
sion analysis  of  measures  of  academic  success,  is  answered 
in  the  affirmative.  There  remains  the  problem  of  reliability 
of  criterion  variables,  as  exemplified  by  the  relatively 
poor  performance  of  all  variable  species  when  predicting 
hours  attempted.  As  stated  earlier,  that  particular  problem 
may  be  a function  of  the  criterion  variable,  and  leaves 
inconclusive  results  concerning  the  relation  of  the  variable 
species  to  that  particular  criterion. 

Question  two  addressed  itself  to  identifying  the  most 
effective  variable  produced  by  profile  analysis.  On  the 


48 


basis  of  comparisons  made  with  the  Tukey  procedure  it  is 
suggested  that,  of  the  variables  generated  by  a method  of 
profile  analysis,  those  produced  by  Guertin's  method  are  the 
most  effective  predictors  when  used  in  the  multiple  regres- 
sion model.  However,  the  differences  between  those  vari- 
ables and  distance  variables  were  not  significant. 

Problem  question  three,  concerning  the  relative  effec- 
tiveness of  variables  produced  via  profile  analysis  and 
those  composed  of  test  items  and  scores , cannot  be  answered 
conclusively.  Differences  were  not  found  to  be  significant, 
but  it  should  be  remembered  that  means  of  type  variables  and 
of  conventional  variables  for  the  female  subgroup  were  found 
to  be  almost  equal  when  observed  during  the  Tukey  procedure. 

An  additional  point  to  consider  is  that  the  amount  of 
variance  accounted  for  by  the  various  methods  of  factor 
analysis  (Table  2,  p.  31)  did  not  show  a direct  relationship 
to  predictive  effectiveness  of  the  variables  produced  from 
those  analyses.  Though  shape  analysis  accounted  for  a 
larger  amount  of  variance  than  did  the  other  methods , the 
variables  it  produced  showed  themselves  to  be  significantly 
less  effective  than  those  produced  by  the  other  methods. 
Variance  accounted  for  by  distance  analysis  ranked  second 
in  magnitude  to  that  accounted  for  by  shape.  However,  the 
distance  variables,  though  not  significantly  different 
according  to  the  Tukey  procedure,  had  a lower  mean  corre- 
lation than  did  type  variables  or  conventional  variables. 
This  was  true  for  both  sexes . 
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The  inverse  relationship  between  amount  of  variance 
accounted  for  by  the  analysis  method  and  the  predictive 
effectiveness  of  its  variables  can  be  interpreted  as  evi- 
dence that  more  than  one  dimension  must  be  considered  when 
constructing  modal  patterns  related  to  academic  achievement. 
Neither  shape  nor  distance  information  is  sufficient  unto 
itself  for  that  purpose;  both  kinds  of  information  are 
valuable.  It  must  be  emphasized,  however,  that  this  impli- 
cation is  valid  when  academic  success  is  the  criterion  mea- 
sure, and  that  generalizations  to  other  criteria  are  not 
indicated. 


CHAPTER  5 


SUMMARY  AND  CONCLUSIONS 

The  work  described  here  represents  an  effort  to  produce 
empirical  constructs  to  account  for  systematic  variance  in 
measures  of  academic  success.  Specifically,  predictive 
effectiveness  of  variable  species  produced  from  the  same 
data  set  through  different  pattern  analytic  procedures  were 
compared  with  each  other  and  with  factor  scores,  also 
derived  from  that  data  set.  A previous  study  (Clarke  and 
Ammons,  1970)  furnished  data  on  1561  college  freshmen  for 
reexamination . 

Pattern  analytic  procedures  utilized  in  the  study  were 
shape  analysis,  distance  analysis,  and  Guertin's  method  of 
profile  analysis,  actually  a successive  combination  of  the 
shape  and  distance  techniques.  Conventional  variables  were 
represented  by  factor  scores.  Criterion  measures  included 
college  first  semester  grade  point  average  (GPA) , total 
credit  hours  attempted  over  a period  of  two  calendar  years, 
and  cumulative  GPA  over  two  calendar  years. 

All  data  were  converted  to  z scores.  The  independent 
variables  were  then  weighted  according  to  an  arbitrary 
scheme  designed  to  give  equal  weight  to  the  cognitive  and 
the  affective  components  of  the  data. 
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Shape  variables  consisted  of  loadings  on  rotated  factors 
produced  with  EEL  504,  and  distance  variables  were  loadings 
on  the  rotated  factors  resulting  from  analysis  of  the  data 
with  EEL  510.  Type  variables  were  produced  with  Guertin's 
Profile  Analysis  Package  (EEL  777)  and  consisted  of  d's 
showing  the  relationship  of  each  individual  to  each  pattern. 
Conventional  variables  were  factor  scores  produced  with 
EEL  518  from  factors  derived  via  EEL  501.  The  procedures 
were  performed  separately  for  each  sex. 

Multiple  regression  analysis  was  performed  separately 
with  all  variable  species  in  an  attempt  to  determine  the 
best  combination  and  weighting  for  predicting  each  of  the 
three  criterion  measures. 

Cross-validation  of  the  equations  using  shape  variables 
and  distance  variables  required  the  development  of  special 
programs  to  estimate  loadings  of  cases  in  the  second  sub- 
sample on  factors  derived  from  data  on  subjects  in  the  first 
subsample.  A revised  version  of  a formula  by  Mosier  (1938) 
was  used  to  extend  the  factor  matrices  in  order  to  re-create 
the  shape  and  the  distance  variables. 

Cross-validation  of  equations  with  type  variables  and 
with  conventional  variables  presented  a less  arduous  task, 
as  the  techniques  for  re-creating  those  variables  were 
comparatively  simple.  Type  variables  were  re-created  with 
a special  program  which  compared  the  scores  of  cases  in 
subsample  two  with  pattern  means  and  weights  derived  from 
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subsample  one.  Conventional  variables  were  simply  factors 
scores  of  subsample  two  produced  with  the  correlation  and 
factor  matrices  derived  via  EEL  501  with  subsample  one. 

Following  their  re-creation  the  appropriate  variables 
were  placed  in  the  prediction  formulae  derived  from  sub- 
sample one.  Pearson  product-moment  correlation  coefficients 
were  obtained  between  the  actual  criterion  measures  for 

each  case  and  the  predicted  criterion  measures  (r  £) . 

Y • Y 

Judgments  concerning  the  order  of  effectiveness  of  the 
variable  species  were  based  on  r a,  the  cross-validation 
correlation  coefficient. 

Inspection  of  the  cross-validation  r's  indicates  that 
profile  analytic  procedures  can  indeed  be  used  to  provide 
continuous  variables  as  input  for  multiple  regression 
analysis.  Naturally,  the  usual  precautions  concerning 
reliability  of  data  must  be  considered. 

In  comparing  the  predictive  effectiveness  of  the  four 
variable  species  it  became  evident  that  those  variables 
providing  only  one  kind  of  information  were  the  poorest 
predictors.  Those  that  took  only  shape  into  account  fared 
least  well,  followed  by  those  considering  distance.  Tukey's 
w procedure  showed  shape  variables  to  be  the  only  ones  sig- 
nificantly different,  however. 

In  looking  at  the  results  of  the  study  from  a practical, 


real-world  approach,  there  are  several  aspects  that  should 
be  considered.  One  of  the  basic  needs  that  triggered  the 
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present  research  was  the  necessity  for  finding  better  tools 
for  educational  and  vocational  counseling.  While  type 
variables  did  not  show  themselves  to  be  superior  to  con- 
ventional variables  in  prediction,  the  differences  were  not 
significant.  Considering  the  wealth  of  information  avail- 
able through  qualitative  study  of  groups  isolated  by  EEL  777 
(Chang,  1971) , the  predictor  use  of  d measures  based  on  the 
types  makes  Guertin's  method  a particularly  valuable  tool. 
However,  it  must  be  assumed  that  not  all  criterion  measures 
will  yield  themselves  equally  well  for  prediction  with  d's 
based  on  systematic  groupings  of  individual  differences. 
Further  investigation  should  be  undertaken  to  determine  what 
kinds  of  first-order  variables  are  appropriate  for  inclusion 
when  the  object  of  grouping  is  to  provide  predictive  cate- 
gories . 

Certain  specific  problems  that  are  current  in  education 
would  seem  to  lend  themselves  readily  to  investigation 
through  qualitative  and  quantitative  use  of  Guertin's  pro- 
file analysis.  As  an  example,  in  the  area  of  special 
education  much  effort  has  been  made  to  distinguish  between 
various  reasons  for  lack  of  success  in  school.  A broad 
spectrum  battery  of  instruments  measuring  physical, 
cognitive,  and  social  dimensions,  administered  to  several 
rectangular  samples  of  preschool  and  early  elementary 
children,  then  processed  with  EEL  777,  would  yield  groups 
for  qualitative  study.  Quantitative  processing  of  d's 
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derived  from  those  groups  in  the  same  manner  as  was  done  in 
the  present  research  might  well  be  a way  to  isolate  modal 
patterns  for  use  by  educators  in  the  specialized  areas. 

In  comparison  to  the  other  variable  species  in  the 
study,  re-creation  of  variables  in  other  subsamples  was 
substantially  easier  for  type  variables.  They  are  more 
readily  obtained  than  factor  scores,  can  be  produced  for 
large  samples,  and  require  only  a small  computer.  Process- 
ing the  first  subsamples  with  EEL  777  is  quite  expensive 
as  pointed  out  earlier,  and  does  require  a computer  with 
large  storage  capacity.  Once  the  patterns  are  identified, 
however,  succeeding  samples  can  be  compared  to  them  at  very 
little  expense.  Considering  the  ease  with  which  these  type 
variables  can  be  obtained,  and  considering  that  the  present 
study  does  not  indicate  that  they  are  ineffective,  further 
investigation  should  be  undertaken  to  refine  their  use  as 
continuous  variables.  The  usual  cautions  concerning 
judicious  use  of  experimental  variables  should  be  followed 


as  a matter  of  course. 


APPENDIX 


TABLES 


TABLE  A j 

Order  of  Selection  of  Variables  When  Predicting 

First  Semester  GPA 
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significantly  at  .01  level, 
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TABLE  E 

F Ratios  for  Difference  in  Predictive  Effectiveness 
Between  Correlations  at  the  First  and  the 
Last  Steps  of  Multiple  Regression  Analysis 
for  All  Equations 


Variable 

species 

First  semester 
GPA 

Hours 

attempted 

Total 

GPA 

Males 

Type 

2.54** 

1.26 

2.22** 

Shape 

2.24* 

2.41* 

2.51* 

Distance 

1.53 

4.31** 

1.07 

Factor  scores 

1.57 

3.17** 

1.86* 

Females 

Type 

.82 

1.17 

.71 

Shape 

.88 

.83 

.93 

Distance 

.54 

.94 

.73 

Factor  scores 

.93 

.95 

1.65 

*p  < .05. 

**p<  .01. 
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